Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 262029 |
| Missing cells | 39896 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 28.0 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Categorical | 4 |
|---|---|
| DateTime | 1 |
| Numeric | 9 |
VERSIE has constant value "1.0" | Constant |
DATUM_BESTAND has constant value "2021-04-16" | Constant |
PEILDATUM has constant value "2021-04-01" | Constant |
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1766 distinct values | High cardinality |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with AANTAL_SUBTRAJECT_PER_SPC | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
BEHANDELEND_SPECIALISME_CD is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with BEHANDELEND_SPECIALISME_CD and 1 other fields | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with AANTAL_SUBTRAJECT_PER_SPC | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
JAAR is highly correlated with AANTAL_PAT_PER_SPC and 1 other fields | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
ZORGPRODUCT_CD is highly correlated with AANTAL_PAT_PER_SPC and 1 other fields | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with JAAR and 2 other fields | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with JAAR and 2 other fields | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
DATUM_BESTAND is highly correlated with PEILDATUM and 1 other fields | High correlation |
PEILDATUM is highly correlated with DATUM_BESTAND and 1 other fields | High correlation |
VERSIE is highly correlated with DATUM_BESTAND and 1 other fields | High correlation |
GEMIDDELDE_VERKOOPPRIJS has 39896 (15.2%) missing values | Missing |
AANTAL_SUBTRAJECT_PER_ZPD is highly skewed (γ1 = 20.88851715) | Skewed |
Reproduction
| Analysis started | 2021-05-11 22:12:58.341761 |
|---|---|
| Analysis finished | 2021-05-11 22:13:30.803813 |
| Duration | 32.46 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 786087 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 262029 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 262029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 262029 | |
| . | 262029 | |
| 0 | 262029 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 524058 | |
| Other Punctuation | 262029 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 262029 | |
| 0 | 262029 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 262029 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 786087 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 262029 | |
| . | 262029 | |
| 0 | 262029 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 786087 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 262029 | |
| . | 262029 | |
| 0 | 262029 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| 2021-04-16 |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2620290 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-04-16 |
|---|---|
| 2nd row | 2021-04-16 |
| 3rd row | 2021-04-16 |
| 4th row | 2021-04-16 |
| 5th row | 2021-04-16 |
Common Values
| Value | Count | Frequency (%) |
| 2021-04-16 | 262029 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2021-04-16 | 262029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 524058 | |
| 0 | 524058 | |
| 1 | 524058 | |
| - | 524058 | |
| 4 | 262029 | |
| 6 | 262029 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2096232 | |
| Dash Punctuation | 524058 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 524058 | |
| 0 | 524058 | |
| 1 | 524058 | |
| 4 | 262029 | |
| 6 | 262029 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 524058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2620290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 524058 | |
| 0 | 524058 | |
| 1 | 524058 | |
| - | 524058 | |
| 4 | 262029 | |
| 6 | 262029 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2620290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 524058 | |
| 0 | 524058 | |
| 1 | 524058 | |
| - | 524058 | |
| 4 | 262029 | |
| 6 | 262029 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| 2021-04-01 |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2620290 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-04-01 |
|---|---|
| 2nd row | 2021-04-01 |
| 3rd row | 2021-04-01 |
| 4th row | 2021-04-01 |
| 5th row | 2021-04-01 |
Common Values
| Value | Count | Frequency (%) |
| 2021-04-01 | 262029 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2021-04-01 | 262029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 786087 | |
| 2 | 524058 | |
| 1 | 524058 | |
| - | 524058 | |
| 4 | 262029 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2096232 | |
| Dash Punctuation | 524058 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 786087 | |
| 2 | 524058 | |
| 1 | 524058 | |
| 4 | 262029 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 524058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2620290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 786087 | |
| 2 | 524058 | |
| 1 | 524058 | |
| - | 524058 | |
| 4 | 262029 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2620290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 786087 | |
| 2 | 524058 | |
| 1 | 524058 | |
| - | 524058 | |
| 4 | 262029 | 10.0% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| Minimum | 2012-01-01 00:00:00 |
|---|---|
| Maximum | 2021-01-01 00:00:00 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 423.0680192 |
| Minimum | 301 |
|---|---|
| Maximum | 8418 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 301 |
|---|---|
| 5-th percentile | 302 |
| Q1 | 305 |
| median | 313 |
| Q3 | 322 |
| 95-th percentile | 335 |
| Maximum | 8418 |
| Range | 8117 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 926.6493055 |
|---|---|
| Coefficient of variation (CV) | 2.190308091 |
| Kurtosis | 70.30172742 |
| Mean | 423.0680192 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 8.496364056 |
| Sum | 110856090 |
| Variance | 858678.9355 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 305 | 37255 | |
| 313 | 33849 | |
| 303 | 30150 | |
| 330 | 20906 | 8.0% |
| 316 | 17889 | 6.8% |
| 308 | 13615 | 5.2% |
| 306 | 10944 | 4.2% |
| 324 | 10913 | 4.2% |
| 301 | 10541 | 4.0% |
| 304 | 8512 | 3.2% |
| Other values (17) | 67455 |
| Value | Count | Frequency (%) |
| 301 | 10541 | 4.0% |
| 302 | 5689 | 2.2% |
| 303 | 30150 | |
| 304 | 8512 | 3.2% |
| 305 | 37255 | |
| 306 | 10944 | 4.2% |
| 307 | 4504 | 1.7% |
| 308 | 13615 | 5.2% |
| 310 | 2908 | 1.1% |
| 313 | 33849 |
| Value | Count | Frequency (%) |
| 8418 | 3466 | 1.3% |
| 1900 | 171 | 0.1% |
| 390 | 673 | 0.3% |
| 389 | 2829 | 1.1% |
| 362 | 3935 | 1.5% |
| 361 | 1842 | 0.7% |
| 335 | 2662 | 1.0% |
| 330 | 20906 | |
| 329 | 684 | 0.3% |
| 328 | 5704 | 2.2% |
| Distinct | 1766 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| 101 | 1099 |
|---|---|
| 402 | 1071 |
| 403 | 1039 |
| 301 | 1036 |
| 203 | 982 |
| Other values (1761) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.35074362 |
| Min length | 2 |
Characters and Unicode
| Total characters | 877992 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 801 |
|---|---|
| 2nd row | 405 |
| 3rd row | 404 |
| 4th row | 399 |
| 5th row | 403 |
Common Values
| Value | Count | Frequency (%) |
| 101 | 1099 | 0.4% |
| 402 | 1071 | 0.4% |
| 403 | 1039 | 0.4% |
| 301 | 1036 | 0.4% |
| 203 | 982 | 0.4% |
| 201 | 979 | 0.4% |
| 401 | 879 | 0.3% |
| 404 | 865 | 0.3% |
| 802 | 858 | 0.3% |
| 409 | 848 | 0.3% |
| Other values (1756) | 252373 |
Length
| Value | Count | Frequency (%) |
| 101 | 1099 | 0.4% |
| 402 | 1071 | 0.4% |
| 403 | 1039 | 0.4% |
| 301 | 1036 | 0.4% |
| 203 | 982 | 0.4% |
| 201 | 979 | 0.4% |
| 401 | 879 | 0.3% |
| 404 | 865 | 0.3% |
| 802 | 858 | 0.3% |
| 409 | 848 | 0.3% |
| Other values (1756) | 252373 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 168223 | |
| 0 | 160507 | |
| 2 | 116339 | |
| 3 | 95299 | |
| 5 | 67325 | |
| 9 | 63507 | 7.2% |
| 4 | 62561 | 7.1% |
| 7 | 51682 | 5.9% |
| 6 | 46002 | 5.2% |
| 8 | 37696 | 4.3% |
| Other values (15) | 8851 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 869141 | |
| Uppercase Letter | 8851 | 1.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1644 | |
| M | 1477 | |
| B | 1059 | |
| E | 763 | |
| Z | 694 | |
| D | 606 | 6.8% |
| A | 580 | 6.6% |
| F | 564 | 6.4% |
| C | 296 | 3.3% |
| K | 284 | 3.2% |
| Other values (5) | 884 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 168223 | |
| 0 | 160507 | |
| 2 | 116339 | |
| 3 | 95299 | |
| 5 | 67325 | |
| 9 | 63507 | 7.3% |
| 4 | 62561 | 7.2% |
| 7 | 51682 | 5.9% |
| 6 | 46002 | 5.3% |
| 8 | 37696 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 869141 | |
| Latin | 8851 | 1.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1644 | |
| M | 1477 | |
| B | 1059 | |
| E | 763 | |
| Z | 694 | |
| D | 606 | 6.8% |
| A | 580 | 6.6% |
| F | 564 | 6.4% |
| C | 296 | 3.3% |
| K | 284 | 3.2% |
| Other values (5) | 884 |
Common
| Value | Count | Frequency (%) |
| 1 | 168223 | |
| 0 | 160507 | |
| 2 | 116339 | |
| 3 | 95299 | |
| 5 | 67325 | |
| 9 | 63507 | 7.3% |
| 4 | 62561 | 7.2% |
| 7 | 51682 | 5.9% |
| 6 | 46002 | 5.3% |
| 8 | 37696 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 877992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 168223 | |
| 0 | 160507 | |
| 2 | 116339 | |
| 3 | 95299 | |
| 5 | 67325 | |
| 9 | 63507 | 7.2% |
| 4 | 62561 | 7.1% |
| 7 | 51682 | 5.9% |
| 6 | 46002 | 5.2% |
| 8 | 37696 | 4.3% |
| Other values (15) | 8851 | 1.0% |
| Distinct | 5895 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 442145006.7 |
| Minimum | 10501002 |
|---|---|
| Maximum | 998418081 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 10501002 |
|---|---|
| 5-th percentile | 28999037 |
| Q1 | 99799063 |
| median | 149899002 |
| Q3 | 990004004 |
| 95-th percentile | 990416053 |
| Maximum | 998418081 |
| Range | 987917079 |
| Interquartile range (IQR) | 890204941 |
Descriptive statistics
| Standard deviation | 429377388.5 |
|---|---|
| Coefficient of variation (CV) | 0.9711234595 |
| Kurtosis | -1.743637859 |
| Mean | 442145006.7 |
| Median Absolute Deviation (MAD) | 119999995 |
| Skewness | 0.4610681944 |
| Sum | 1.15854814 × 1014 |
| Variance | 1.843649417 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 990004009 | 1898 | 0.7% |
| 990003004 | 1879 | 0.7% |
| 990004007 | 1871 | 0.7% |
| 990004006 | 1518 | 0.6% |
| 990356076 | 1343 | 0.5% |
| 990356073 | 1246 | 0.5% |
| 990003007 | 1204 | 0.5% |
| 131999228 | 1141 | 0.4% |
| 131999164 | 1127 | 0.4% |
| 199299013 | 1087 | 0.4% |
| Other values (5885) | 247715 |
| Value | Count | Frequency (%) |
| 10501002 | 6 | |
| 10501003 | 9 | |
| 10501004 | 10 | |
| 10501005 | 9 | |
| 10501007 | 3 | < 0.1% |
| 10501008 | 9 | |
| 10501010 | 9 | |
| 10501011 | 3 | < 0.1% |
| 11101002 | 7 | |
| 11101003 | 9 |
| Value | Count | Frequency (%) |
| 998418081 | 128 | |
| 998418080 | 115 | |
| 998418079 | 34 | < 0.1% |
| 998418077 | 7 | < 0.1% |
| 998418076 | 6 | < 0.1% |
| 998418075 | 5 | < 0.1% |
| 998418074 | 161 | |
| 998418073 | 161 | |
| 998418072 | 6 | < 0.1% |
| 998418071 | 6 | < 0.1% |
AANTAL_PAT_PER_ZPD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 9080 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 513.6010442 |
| Minimum | 1 |
|---|---|
| Maximum | 156439 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 14 |
| Q3 | 104 |
| 95-th percentile | 1744 |
| Maximum | 156439 |
| Range | 156438 |
| Interquartile range (IQR) | 101 |
Descriptive statistics
| Standard deviation | 3149.337782 |
|---|---|
| Coefficient of variation (CV) | 6.131875739 |
| Kurtosis | 384.8124431 |
| Mean | 513.6010442 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 16.34197142 |
| Sum | 134578368 |
| Variance | 9918328.467 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 43132 | 16.5% |
| 2 | 21126 | 8.1% |
| 3 | 13724 | 5.2% |
| 4 | 10247 | 3.9% |
| 5 | 7897 | 3.0% |
| 6 | 6650 | 2.5% |
| 7 | 5556 | 2.1% |
| 8 | 4702 | 1.8% |
| 9 | 4355 | 1.7% |
| 10 | 3801 | 1.5% |
| Other values (9070) | 140839 |
| Value | Count | Frequency (%) |
| 1 | 43132 | |
| 2 | 21126 | |
| 3 | 13724 | 5.2% |
| 4 | 10247 | 3.9% |
| 5 | 7897 | 3.0% |
| 6 | 6650 | 2.5% |
| 7 | 5556 | 2.1% |
| 8 | 4702 | 1.8% |
| 9 | 4355 | 1.7% |
| 10 | 3801 | 1.5% |
| Value | Count | Frequency (%) |
| 156439 | 1 | |
| 154821 | 1 | |
| 153882 | 1 | |
| 144701 | 1 | |
| 114370 | 1 | |
| 112195 | 1 | |
| 110013 | 1 | |
| 109108 | 1 | |
| 108960 | 1 | |
| 105337 | 1 |
AANTAL_SUBTRAJECT_PER_ZPD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 9706 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 601.1570971 |
| Minimum | 1 |
|---|---|
| Maximum | 239907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 15 |
| Q3 | 114 |
| 95-th percentile | 1981 |
| Maximum | 239907 |
| Range | 239906 |
| Interquartile range (IQR) | 111 |
Descriptive statistics
| Standard deviation | 3992.932227 |
|---|---|
| Coefficient of variation (CV) | 6.642077831 |
| Kurtosis | 698.4977423 |
| Mean | 601.1570971 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 20.88851715 |
| Sum | 157520593 |
| Variance | 15943507.77 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 41564 | 15.9% |
| 2 | 20781 | 7.9% |
| 3 | 13589 | 5.2% |
| 4 | 10064 | 3.8% |
| 5 | 7822 | 3.0% |
| 6 | 6631 | 2.5% |
| 7 | 5525 | 2.1% |
| 8 | 4636 | 1.8% |
| 9 | 4261 | 1.6% |
| 10 | 3872 | 1.5% |
| Other values (9696) | 143284 |
| Value | Count | Frequency (%) |
| 1 | 41564 | |
| 2 | 20781 | |
| 3 | 13589 | 5.2% |
| 4 | 10064 | 3.8% |
| 5 | 7822 | 3.0% |
| 6 | 6631 | 2.5% |
| 7 | 5525 | 2.1% |
| 8 | 4636 | 1.8% |
| 9 | 4261 | 1.6% |
| 10 | 3872 | 1.5% |
| Value | Count | Frequency (%) |
| 239907 | 1 | |
| 232484 | 1 | |
| 231317 | 1 | |
| 227658 | 1 | |
| 221391 | 1 | |
| 218623 | 1 | |
| 208875 | 1 | |
| 203237 | 1 | |
| 202561 | 1 | |
| 202398 | 1 |
AANTAL_PAT_PER_DIAG
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 7926 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7743.301169 |
| Minimum | 1 |
|---|---|
| Maximum | 216998 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 43 |
| Q1 | 422 |
| median | 1744 |
| Q3 | 6534 |
| 95-th percentile | 36952 |
| Maximum | 216998 |
| Range | 216997 |
| Interquartile range (IQR) | 6112 |
Descriptive statistics
| Standard deviation | 17775.54068 |
|---|---|
| Coefficient of variation (CV) | 2.295602392 |
| Kurtosis | 32.39185591 |
| Mean | 7743.301169 |
| Median Absolute Deviation (MAD) | 1580 |
| Skewness | 4.959241839 |
| Sum | 2028969462 |
| Variance | 315969846.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 429 | 0.2% |
| 14 | 388 | 0.1% |
| 17 | 375 | 0.1% |
| 19 | 359 | 0.1% |
| 23 | 358 | 0.1% |
| 25 | 356 | 0.1% |
| 37 | 350 | 0.1% |
| 33 | 347 | 0.1% |
| 15 | 345 | 0.1% |
| 26 | 342 | 0.1% |
| Other values (7916) | 258380 |
| Value | Count | Frequency (%) |
| 1 | 330 | |
| 2 | 308 | |
| 3 | 301 | |
| 4 | 318 | |
| 5 | 285 | |
| 6 | 338 | |
| 7 | 330 | |
| 8 | 334 | |
| 9 | 326 | |
| 10 | 274 |
| Value | Count | Frequency (%) |
| 216998 | 23 | |
| 212133 | 25 | |
| 209818 | 19 | |
| 208372 | 17 | |
| 204232 | 17 | |
| 203537 | 17 | |
| 200180 | 16 | |
| 198508 | 20 | |
| 189111 | 19 | |
| 186555 | 20 |
AANTAL_SUBTRAJECT_PER_DIAG
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 8815 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10954.70991 |
| Minimum | 1 |
|---|---|
| Maximum | 347719 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 55 |
| Q1 | 552 |
| median | 2407 |
| Q3 | 9063 |
| 95-th percentile | 51707 |
| Maximum | 347719 |
| Range | 347718 |
| Interquartile range (IQR) | 8511 |
Descriptive statistics
| Standard deviation | 25947.3662 |
|---|---|
| Coefficient of variation (CV) | 2.368603681 |
| Kurtosis | 36.44294955 |
| Mean | 10954.70991 |
| Median Absolute Deviation (MAD) | 2203 |
| Skewness | 5.225350286 |
| Sum | 2870451682 |
| Variance | 673265812.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 316 | 0.1% |
| 32 | 302 | 0.1% |
| 18 | 297 | 0.1% |
| 17 | 294 | 0.1% |
| 38 | 290 | 0.1% |
| 11 | 285 | 0.1% |
| 19 | 284 | 0.1% |
| 34 | 284 | 0.1% |
| 1 | 279 | 0.1% |
| 22 | 278 | 0.1% |
| Other values (8805) | 259120 |
| Value | Count | Frequency (%) |
| 1 | 279 | |
| 2 | 244 | |
| 3 | 260 | |
| 4 | 254 | |
| 5 | 227 | |
| 6 | 278 | |
| 7 | 244 | |
| 8 | 216 | |
| 9 | 220 | |
| 10 | 271 |
| Value | Count | Frequency (%) |
| 347719 | 23 | |
| 345669 | 25 | |
| 340520 | 19 | |
| 323708 | 20 | |
| 305768 | 17 | |
| 298768 | 17 | |
| 296875 | 17 | |
| 288419 | 16 | |
| 267042 | 19 | |
| 265271 | 7 | < 0.1% |
AANTAL_PAT_PER_SPC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 258 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 677488.3305 |
| Minimum | 3 |
|---|---|
| Maximum | 1489511 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 46186 |
| Q1 | 292896 |
| median | 748499 |
| Q3 | 995511 |
| 95-th percentile | 1345267 |
| Maximum | 1489511 |
| Range | 1489508 |
| Interquartile range (IQR) | 702615 |
Descriptive statistics
| Standard deviation | 411806.1862 |
|---|---|
| Coefficient of variation (CV) | 0.6078424788 |
| Kurtosis | -1.05883127 |
| Mean | 677488.3305 |
| Median Absolute Deviation (MAD) | 306271 |
| Skewness | 0.0118735588 |
| Sum | 1.775215898 × 1011 |
| Variance | 1.69584335 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 880967 | 5102 | 1.9% |
| 874259 | 4354 | 1.7% |
| 843996 | 4348 | 1.7% |
| 893038 | 4332 | 1.7% |
| 874979 | 4271 | 1.6% |
| 852295 | 4166 | 1.6% |
| 1084205 | 3891 | 1.5% |
| 1063762 | 3851 | 1.5% |
| 1077534 | 3847 | 1.5% |
| 1045611 | 3820 | 1.5% |
| Other values (248) | 220047 |
| Value | Count | Frequency (%) |
| 3 | 3 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 10 | 21 | |
| 12 | 3 | < 0.1% |
| 28 | 12 | |
| 32 | 6 | < 0.1% |
| 36 | 5 | < 0.1% |
| 57 | 25 |
| Value | Count | Frequency (%) |
| 1489511 | 2976 | |
| 1450632 | 3054 | |
| 1421864 | 3564 | |
| 1345267 | 3543 | |
| 1332918 | 3546 | |
| 1313951 | 3461 | |
| 1296736 | 1181 | 0.5% |
| 1283097 | 3577 | |
| 1263853 | 3389 | |
| 1262603 | 1201 | 0.5% |
AANTAL_SUBTRAJECT_PER_SPC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 258 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1075098.901 |
| Minimum | 3 |
|---|---|
| Maximum | 2582911 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 49553 |
| Q1 | 483156 |
| median | 1088467 |
| Q3 | 1729116 |
| 95-th percentile | 2488921 |
| Maximum | 2582911 |
| Range | 2582908 |
| Interquartile range (IQR) | 1245960 |
Descriptive statistics
| Standard deviation | 714274.2511 |
|---|---|
| Coefficient of variation (CV) | 0.6643800405 |
| Kurtosis | -0.8748013266 |
| Mean | 1075098.901 |
| Median Absolute Deviation (MAD) | 631527 |
| Skewness | 0.2984517275 |
| Sum | 2.817070898 × 1011 |
| Variance | 5.101877057 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1211813 | 5102 | 1.9% |
| 1281549 | 4354 | 1.7% |
| 1216284 | 4348 | 1.7% |
| 1312993 | 4332 | 1.7% |
| 1290187 | 4271 | 1.6% |
| 1263165 | 4166 | 1.6% |
| 2557169 | 3891 | 1.5% |
| 2489027 | 3851 | 1.5% |
| 2582911 | 3847 | 1.5% |
| 2488921 | 3820 | 1.5% |
| Other values (248) | 220047 |
| Value | Count | Frequency (%) |
| 3 | 3 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 10 | 21 | |
| 12 | 3 | < 0.1% |
| 28 | 12 | |
| 32 | 6 | < 0.1% |
| 36 | 5 | < 0.1% |
| 57 | 25 |
| Value | Count | Frequency (%) |
| 2582911 | 3847 | |
| 2557169 | 3891 | |
| 2489027 | 3851 | |
| 2488921 | 3820 | |
| 2184663 | 3757 | |
| 2066406 | 3810 | |
| 1978605 | 3691 | |
| 1962349 | 1166 | 0.4% |
| 1943198 | 2976 | |
| 1939022 | 1160 | 0.4% |
| Distinct | 3182 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 39896 |
| Missing (%) | 15.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3522.892952 |
| Minimum | 20 |
|---|---|
| Maximum | 287220 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 140 |
| Q1 | 465 |
| median | 1245 |
| Q3 | 4080 |
| 95-th percentile | 13290 |
| Maximum | 287220 |
| Range | 287200 |
| Interquartile range (IQR) | 3615 |
Descriptive statistics
| Standard deviation | 6596.672459 |
|---|---|
| Coefficient of variation (CV) | 1.872515728 |
| Kurtosis | 167.6851786 |
| Mean | 3522.892952 |
| Median Absolute Deviation (MAD) | 1010 |
| Skewness | 7.813549912 |
| Sum | 782550780 |
| Variance | 43516087.53 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 1855 | 0.7% |
| 160 | 1831 | 0.7% |
| 110 | 1465 | 0.6% |
| 180 | 1382 | 0.5% |
| 120 | 1326 | 0.5% |
| 185 | 1306 | 0.5% |
| 300 | 1267 | 0.5% |
| 145 | 1265 | 0.5% |
| 140 | 1211 | 0.5% |
| 500 | 1159 | 0.4% |
| Other values (3172) | 208066 | |
| (Missing) | 39896 | 15.2% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 70 | 226 | 0.1% |
| 75 | 75 | < 0.1% |
| 80 | 361 | 0.1% |
| 85 | 929 | |
| 90 | 492 | 0.2% |
| 95 | 522 | 0.2% |
| 100 | 981 | |
| 105 | 1855 | |
| 110 | 1465 |
| Value | Count | Frequency (%) |
| 287220 | 8 | |
| 148910 | 3 | < 0.1% |
| 142855 | 4 | |
| 122155 | 4 | |
| 116765 | 3 | < 0.1% |
| 109745 | 7 | |
| 108570 | 7 | |
| 107655 | 4 | |
| 101270 | 8 | |
| 95465 | 7 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| VERSIE | DATUM_BESTAND | PEILDATUM | JAAR | BEHANDELEND_SPECIALISME_CD | TYPERENDE_DIAGNOSE_CD | ZORGPRODUCT_CD | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_ZPD | AANTAL_PAT_PER_DIAG | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_SUBTRAJECT_PER_SPC | GEMIDDELDE_VERKOOPPRIJS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 801 | 131999206 | 219 | 235 | 941 | 1050 | 237383 | 280015 | 285.0 |
| 1 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 405 | 131999207 | 168 | 168 | 11623 | 13281 | 237383 | 280015 | 600.0 |
| 2 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 404 | 131999118 | 4 | 4 | 908 | 1026 | 237383 | 280015 | 825.0 |
| 3 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 399 | 131999010 | 39 | 39 | 2122 | 2485 | 237383 | 280015 | 2230.0 |
| 4 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 403 | 131999084 | 8 | 8 | 1774 | 1963 | 237383 | 280015 | 1110.0 |
| 5 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 711 | 131999206 | 9 | 9 | 39 | 43 | 237383 | 280015 | 285.0 |
| 6 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 316 | 131999208 | 24 | 24 | 127 | 153 | 237383 | 280015 | 400.0 |
| 7 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 604 | 131999207 | 1 | 1 | 48 | 58 | 237383 | 280015 | 600.0 |
| 8 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 309 | 131999154 | 1 | 1 | 9443 | 11467 | 237383 | 280015 | NaN |
| 9 | 1.0 | 2021-04-16 | 2021-04-01 | 2012-01-01 | 324 | 109 | 131999208 | 47 | 47 | 285 | 347 | 237383 | 280015 | 400.0 |
Last rows
| VERSIE | DATUM_BESTAND | PEILDATUM | JAAR | BEHANDELEND_SPECIALISME_CD | TYPERENDE_DIAGNOSE_CD | ZORGPRODUCT_CD | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_ZPD | AANTAL_PAT_PER_DIAG | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_SUBTRAJECT_PER_SPC | GEMIDDELDE_VERKOOPPRIJS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 262019 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0118 | 990027140 | 4 | 4 | 930 | 1533 | 199707 | 370698 | NaN |
| 262020 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0512 | 990027209 | 1 | 1 | 2291 | 4416 | 199707 | 370698 | NaN |
| 262021 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0216 | 990027144 | 10 | 10 | 3210 | 6204 | 199707 | 370698 | NaN |
| 262022 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0115 | 990027136 | 51 | 51 | 15618 | 25587 | 199707 | 370698 | 26050.0 |
| 262023 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0516 | 990027200 | 40 | 46 | 1342 | 2944 | 199707 | 370698 | NaN |
| 262024 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0314 | 990027160 | 1820 | 2409 | 5172 | 9858 | 199707 | 370698 | 3160.0 |
| 262025 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0216 | 990027135 | 1 | 1 | 3210 | 6204 | 199707 | 370698 | 40545.0 |
| 262026 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0715 | 990027188 | 20 | 20 | 7641 | 12851 | 199707 | 370698 | NaN |
| 262027 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0315 | 990027159 | 118 | 140 | 1075 | 2061 | 199707 | 370698 | 9500.0 |
| 262028 | 1.0 | 2021-04-16 | 2021-04-01 | 2018-01-01 | 327 | 0415 | 990027164 | 22 | 22 | 2444 | 4274 | 199707 | 370698 | 20170.0 |